Chaotic principle: an experimental test 



F. Bonetto*, G. Gallavotti , P. L. Garndo® 
* Matematica, 1°" Universitd di Roma, P.le Moro 2, 00185 Roma, Italia 
Fisica, /° Universitd di Roma, P.le Moro 2, 00185 Roma, Italia 
® Instituto Carlos I de Fisica Teorica y Computacional, Universidad de Granada 

E- 18071 Granada, Espana 

■ Abstract: The chaotic hypothesis discussed in [GCl] is tested experimentally in a simple con- 

■ duction model. Besides a confirmation of the hypothesis predictions the results suggest the 
0^ \ validity of the hypothesis in the much wider context in which, as the forcing strength grows, the 

attractor ceases to he an Anosov system and becomes an Axiom A attractor. A first test of the 
\—i ' new predictions is also attempted. 
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' §-?• Introduction. 

A principle holding when motions have an empirically chaotic nature was introduced in refer- 
1^ ; ence [GC2]: 



Chaotic hypothesis: A many particle system in a stationary state can be regarded as a smooth 
dynamical system with a transitive^ Axiom A global attractor for the purpose of computing 
■ macroscopic properties. In the reversible case it can be regarded, for the same purposes, as a 

' .smooth transitive Anosov system. 



For an informal discussion of the properties of Anosov systems relevant for this work, in 
particular for the "Boltzmanian representation" of the SRB distribution possible for them, 
^X^'' see [GC2], [Gl]. See [AA], [Sm] for a general discussion of the basic geometrical ideas and 

[S], [R1],[R2], [Bo] for the original and complete descriptions of the mathematical notion and 
properties of Anosov and Axiom A systems. 
[ The results of this work mainly concern the reversible Anosov case: in the concluding remarks 

O ■ we discuss various questions related to reversibility and to the Axiom A cases. Therefore the 

part of the hypothesis that concerns reversible systems is essential for our applications. It can 
|. be rephrased in various ways and by doing so one gains some insights into the meaning of it: 

rS I see §6. 

' This implies that the macroscopic time averages are described by a probability distribution 

fi on the "phase space" C of observed events, also called timing events, (which could be, for 
instance, the occurrence of a microscopic binary "collision"). The time evolution, or the dy- 
namics, is a map S* of C into itself. The map S is derived from the flow Qt that solves the 
differential equations of motion of the system: the timing events C have to be thought as a 
surface transversal to the flow and if t{x) is the time between the timing event x and the suc- 
cessive one Sx it is Qt{x)X = Sx. Note that the points Qtx are not timing events {i.e. they are 
not in C) for the intermediate times < t < t{x).'^ We call Qt the continuous time evolution 



^ The notion of transitivity used in the above reference, and in the related ones, is that the stable and unstable 
manifolds of each point of the attractor are dense on the attractor. 

2 

One may wonder why we do not time the observations at constant pace: this would indeed be possible. It is however 
convenient to time the observations on natural events (i.e., using the jargon, "to make observations on a Poincarc's 
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and S the discrete time cvohition. 

The existence of the distribution /x is assumed in general, as stated by the following (extension) 
of the zeroth law, [UF] , giving a global property of the motions generated by initial data chosen 
randomly with distribution /xq proportional to the volume measure on C: 

Extended zero-th law: A dynamical system {C,S) modeling a many particle system (or a con- 
tinuum such as a, fl,uid) describes m,otions that admit a statistics /i in the sense th,at, given 
any (smooth) macroscopic observable F defined on the points x of the phase space C, the time 
average of F exists for all -randomly-chosen initial data x and is given by: 

1 r 

M E ^(^'^) = / l^idxW) (1.1) 

k=0 

where iJ is a S -invariant probability distribution on C. 

The chaotic hypothesis was proposed by Ruelle in the case of fluid turbulence, and it is extended 
to non equilibrium many particle systems in [GCl]. If one assumes it, then it follows that the 
zeroth law holds, [S],[Bo],[Rl]; however it is convenient to regard the two statements as distinct 
because the hypothesis we make is " onZy" that one can suppose that the system is Anosov 
for "practical purposes": this leaves the possibility that it is not strictly speaking such, and 
some corrections ("negligible in the thermodynamic limit") may be needed on the predictions 
obtained by using the hypothesis. 

In [GC2] the generality of the hypothesis is discussed and in [GCl], [GC2], we derived, as a 
rather general consequence, predictions testable at least by numerical or physical experiments 
in systems with few degrees of freedom. The feature of the prediction relevant for numerical 
experiments, a large deviation theorem or fluctuation theorem, is that it is parameter free; other 
results concern the Onsager reciprocity in various classes of mechanical systems [G5], or fluid 
models [G4]. 

The theory was developed to understand the results in [ECM2] which, therefore, provided also 
the first test. In this paper we present the results of numerical experiments that we conducted 
in order to check the hypothesis in models different from the shear flow model in [ECM2] . 

Being a rather general principle, the chaotic hypothesis yields predictions that are sharp and 
inescapable, without free parameters. Hence it is important to check it with the highest pre- 
cision possible. This immediately leads one to work at the limit of the present day computer 
capabilities and to lengthy data elaboration. 

In §2 we describe the models and give a quantitative description of their rough characteris- 
tics, discussing the experiments that we perform. In §3 we explain, through an analogy with 
well known Ising model properties, the mechanism which allows us to acquire exact knowl- 
edge of some propeties of ^ without actually computing ^ itself (which might be a surprising 
"achievement" if one does not examine [GC1],[GC2] in some detail). In §4 we present the raw 
experimental results; in §5 and in the Appendix we briefly summarize the methods followed in 
statistical analysis of the result. In §6 we present comments, plans and perspectives, and some 
challenging pictures that seem to emerge from [G4],[G5][BG] and from the present paper. 



section") so that one eliminates one degree of freedom as well as the corresponding (trivially zero) Lyapunov exponent, 
as recognized very early, [L]. 



2 



%2. The models and a description of the experiments. 



The models contain thermostat mechanisms in order to enable the systems to reach a non 
equilibrium stationary state in the presence of an imposed external field: therefore they are 
related to electrical conductivity problems. They represent a gas of N identical particles with 
mass m, interacting via a hard core pair potential tp with radius r and with an external potential 
(p"^ ^ 0. The gas is enclosed in a 2 dimensional box [— |i, fi] x [—^L, |L], k,£— 1,2, . . ., and 
is subject to periodic boundary conditions and to a horizontal constant external field Ei (i is 
a unit vector in the a;-direction) . The external potential will also be just a hard core interaction 
excluding access to the area covered by some obstacles or forbidding the crossing of the box 
walls. The obstacles are hard disks with centers situated on two square lattices with spacing 
L shifted by ■^L relative to each other. The radii of the disks of each sub lattice are equal and 
respectively given by i?i and R2, so fixed that every rectilinear trajectory must suffer collisions 
with them. An alternative setting could have been a collection of identical hard disks (with 
large radii) with centers on a triangular lattice: the adopted geometry is the same as that of 
the previous paper [GG]. 

The geometry is very simple and the position space is described in Fig.l: 



(2.1) 



Fig.l: General billiard structure with scatterers of radius Ri and R2 in a periodic box with side length L, (case 
fe X £ = 1 X 1). 

The box [—^L,^L] x [— |L, |L] consists of k ■ £ unit lattice cells with side L joined to form 
a square box: at the box boundary we impose periodic boundary conditions {pbc) or, alter- 
natively, semi periodic boundary conditions {^pbc): in the latter case the "horizontal" box 
walls are reflecting. The experiments with different boundary conditions have been performed 
"completely" independently, on difi^erent machines and with difi^erent codes. 

The system is in contact with a "thermostat" which adds (or subtracts) energy so that the 
total internal energy stays rigorously constant. The equations of motion are: 




= Fj + Ei~ a{p)p. 



(2.2) 



with j = 1,. . . ,N; a{p) ~ E i ■ P j/{'l2j P^) is the (impulsive) force acting on 

particle j (which is due only to the hard cores). The a-term incorporates the coupling to a 
"Gaussian thermostat" and it is assumed to obey Gauss' "principle of least constraint", see 
[LA]. The constraint here is the constancy of the internal {i.e. kinetic) energy: 



N 



— — ^-^ 2m 



(2.3) 



a typical nonholonomic constraint; it follows then from Gauss' principle that the force cor- 
responding to the constraint is proportional to the gradient with respect to p . of B.. This 
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model has been studied in great detail, in [CELS], in the case iV = 1; a similar model has been 
investigated numerically in [BEC], and very recently in [DPH], [DM]. It is part of a wide class 
of models, see [GC1],[GC2], whose interest for the theory of non equilibrium stationary states 
was pointed out in [HHP] , [PH] , where one can find the first studies performed in the context 
in which we are interested. 

The timing events that we choose to follow are simply the collisions: whether with the walls 
or with the obstacles or with other particles. The boundary conditions will be periodic in both 
directions or reflecting in the one perpendicular to the field ("vertical") and periodic in the 
other ( "horizontal" ) . 

The initial data will be fixed by a random choice with absolutely continuous distribution on 
the full phase space T {i.e. on the full energy surface). 

The dimension of the phase space !F of this system is that of the energy surface, i.e. AN — 1, 
and that of the set of timing events C is 2D with D = 2N — 1, i.e. one unit less than the 
dimension of J^. The phase space "contraction" rate, i.e. the divergence of the right hand side 
of (2.2), is j{x) = Da{x). It can be written in the form: 

j{x) = Da{x) = , 2 

where 'W = X]j P average momentum, e{x) is the work done on the system per unit time 

by the external field and ^UbT^x) is -^{p ^ — P")^ which, if ks is Boltzmann's constant, 

defines the kinetic temperature: hence the name of entropy production rate per (kinetic) degree 
of freedom that will be occasionally given to q{x). Note that ^{x) does not have a definite sign. 
In dimension d>2 the factor |^ would become 

The above contraction corresponds to a contraction of the volume in the full phase space 
between one collision and the next, given by e"*'"^^^) with: 

a{x) = ^ \{Qtx)dt (2.5) 
10 Jo 

if Qt is the continuous time evolution (see §1) and to denotes the mean collision time to = {t{-))+. 
We shall be concerned mostly with the contraction rate a^rix) occurring during r time steps 
as the system evolves between S^'^^^x and S'^^^x: 

t/2-1 

ar{x)'^M - a{S^x) (2.6) 

j=-r/2 

It has been proved, [CELS], that for A'' = 1 and small E ^ the average (cr)+ is positive, 
i.e. the system is dissipative. There seems to be no reason to think that {(t)+ is not positive 
when > 1 and our experiments show that indeed this seems to be the case. 

Recently, in fact, it has been shown that under very general conditions (essentially under the 
assumption that the extended zero-th law holds) it must be (cr)+ > 0, [R3]. In the latter paper 
it is also shown that (cr)+ > if the SRB distribution /x gives probability 1 to a set which has 
zero Liouville measure {i.e. if the attractor is "really" smaller than the full phase space). ^ 

Therefore it is natural to write: 



We call attractor a set G with minimal (Hausdorff) dimension which has the property that ^{G) = 1, i.e. which has 
probability 1 in the stationary state. In general the closure clos(G) of the attractor may be the whole space C while 
the dimension of G may be much less. 
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(2.4) 



CTr{x) = {(t)+p{x) 



(2.7) 



SO that the contraction of the phase space volume, while the system evolves between S '^I'^x 
and S^l'^x, is e-^*°<'">+P(^) and = 1. 

In this case the time reversal map i defined by? : {q , p) — > {q , — p) is such that cr^ (ix) = 
-arix). 

The number k ■ I o{ unit lattice colls forming the box containing the gas will be called the size 
of the box. We define the density as p = FHT' '^^'^ energy density as eo = J2j ^P'j ^^'^ 
we take units so that m = l,eo = 1/2, L = 1. The properties of the system are thus governed 
by the values of the parameters i?2, r and p; in place of p one could use the occupied volume 
^ = i/-v° where Vq is the particles volume, V = MLi^ is the box volume and Vobs is the total 
volume of the obstacles, so that o is the ratio between the volume occupied by the particles 
cores and the free volume where they can roam (i.e. the volume of the cell outside the volume 
Vobs occupied by the obstacles). 

The field intensity E is fixed to E = 1 m all experiments. We shall consider systems with 
iV = 2, TV = 10, radii Ri = 0.2, i?2 = 0.4 and r = 0.005 or r = 0.01 depending on the type 
of boundary conditions denoted, above, by phc and \phc. Here is a list of the above kinematic 
quantities in the cases that we shall consider: 



density 


P 


N 
V 


energy density 


eo 


1 
2 


mass 


m 


1 


box side 


L 


1 


obstacles radii 


Rl, R2 


0.2, 0.4 


particles radii 


r 


0.005(3360) or 0.01(^3960) 


occupied volume 


5 


NVo 


size 


k X I 


1 X l(p6c)or2 X 2{\pbc, N 


particles number 


N 


2 or 10 



2) or 4 X 5(ip6c, N = 10) 



The following dynamical quantities are particularly interesting for the qualitative picture of 
the motions. 

•The average timing to of the collisions, equal to the future average (t(-))+ of the time t{x) 
elapsing between two successive collisions. 

•The collision rate v will be the number of collisions between moving particles divided by the 
total number of collisions including the ones with the obstacles and the walls (the latter are 
present only in the \pbc case). 

•The average entropy creation per collision, equal to the future average to(f)+- 
•The Lyapunov exponents Amax and Amin defined, respectively, by the largest expansion rates 
of line elements under the action of the positive iterates of the evolution map S or by the 
minimum absolute value of the expansion or contraction rates. 

•The entropy correlation time defined by the decay rate {aT{S"x)aT{x))+ — 0(e^''") of the 
entropy autocorrelation (recall that Qt denotes the continuous time evolution, see §1). 

The data below have been obtained empirically, i.e. without any attempt at estimating errors, 

and are usefiil to get an idea of the basic qualitative properties of the system. For N = 2 
particles with cq = 1/2, m = L = 1, E = 1, density, k,£, and radii as above: 



5 







pbc, k = 1 


^pbc, k = 2 


p 


density 


2 


0.5 


s 


occup. vol. 


4.23- 10"^ 


4.23- 10-^ 


\-l 

^max 




1.32 


1.31 


, 1 

A 

min 




1.53-101 


2.16- IQi 




Vr 


< 20 


< 20 




timing pace 


1.50-10-1 


1.35-10-1 


V 


collision rate 


1.09- 10-2 


4.93- 10-3 




enlr<jp{j pr<jd. 
deg. freedom 


1.75-10-2 


1.49-10-2 



(2.8) 



where only the errors (not shown, but amounting to less than 0.1%; sec below) on to, {<^)+ have 
been measured with care since we need them in our experiments. The other data are purely 
indicative of the orders of magnitude; Vr means for all values of r considered below. 
For 10 particle systems: 







pbc, fc = 1 


ip&c, k = 2 


p 


density 


10 


0.5 


5 


occup. vol. 


2.11-10-3 


4.23 • 10-^ 


\-i 

^max 




3.50 


4.71 


\-l 
mill 




3.12- 102 


2.20 • 103 


1?-1 


Vr 


< 20 


< 20 


to 


timing pace 


2.87- 10-2 


2.8 - 10-2 


V 


pair collisions 


8.95-10-2 


8.92 - 10-3 




entropy prod, 
deg. freedom 


4.07 • 10-3 


3.52 • 10-3 



(2.9) 



with the same comments on the errors and the symbols as presented in (2.8). 
We shall study the probability distribution iTt{p) dp, in the stationary state fi, of the variable 
p that is defined by (2.6) above for r largo. In fact the theory of the chaotic hypothesis foresees 
that, if T is large compared to A-Jj^, then tTt{jp) verifies: 

log^^ = rto(a)+p (2.10) 

This is the content of the fluctuation theorem discussed in [GC1],[CG2],[G2],[G1] and it means 
that the odd part of log tt^ [p) is linear in p with an a priori determined slope. Nothing is known 
about the even part. 

One may think that the even part is proportional to p^, i.e. log = —\{p— l)2rto(c)+; 
therefore it is interesting to check whether the kurtosis: 

_ ((p-i)V-3((p-i)2)^ .2^^^ 

- ((P- 1)2)2 (2.11) 

vanishes, if (-)+ denotes average with respect to the SRB distribution /j. in (1.1) (recall that 
Kr = for a Gaussian distribution and the value Kr can be taken as a quantitative dimensionless 

estimate of the "non gaussian" nature of the distribution). The central limit theorem for 
transitive Anosov systems, [S], implies a Gaussian distribution for the variable CTx for large r; 
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this yields information about the deviations of ra-j- from its average t{ij)^ by quantities of order 
\/t; but (2.10) describes properties of large deviations, proportional to r. Therefore there is no 
a priori reason to expect that the 77^(7)) is Gaussian; hence we do not expect that k,t = 0; and 
the evaluation of k^, once (2.10) is established, is of considerable interest. 
Note that the variable ar varies on a finite range, at fixed N. This means that p varies in 
a finite interval [—p*,p*], symmetric by the time reversal symmetry, whose size can be easily 
measured and an idea of its value can be obtained from the following rough data for r = 20: 

pbc, N = 2 \phc, N = 2 pbc, N = 10 \pbc, TV = 10 

p* 9.91 9.25 7.92 8.55 (2.12) 

that give the values of p* on the actually observed trajectories. The definition of a and (2.5), 
by using the Schwartz inequality, imply a bound: 

(2.13) 



if Co is the energy per particle and tmax is the maximum time between collisions. And the 
bound is saturated when all the particles have the same velocity parallel to the field. 

Finally the result (2.10) is valid in the limit r ^ 00 and this means that, in order to check it, 
one has to perform many experiments with various values of r: one expects that r should be 
large compared to at least at fixed p (but the errors are not expected to be uniform in p 
so that one should not be surprised to see still corrections at large values of p for values of r 
for which (2.10) holds without appreciable corrections at small values of p). 

In this experiment we have computed the distribution 7rr(p) at fixed r (using the discrete 
cvohition S) by measuring p over time intervals of length r but spaced by a fixed num,her of 
collisions A during which no measurement is made. The time interval A has been taken large 
compared to the relevant characteristic times of the system, i.e. the average free flight time or 
the inverse of the time decay constant for the entropy autocorrelation function. The latter times 
being of the same order of magnitude and of the order of 1 to 10 collision times, we took A = 50 
collisions, see (2. 8), (2. 9). Larger A would have been better: but we would lose statistics. We 
then assume, in the statistical analysis, that the data so obtained are uncorrelated. 

We made some empirical tests that this time delay was sufficiently large by investigating, in 
various cases, how important the time correlations were in the evaluation of the errors. Not 
unexpectedly we found that the statistical errors decrease by increasing the sampling delay A, 
in spite of the smaller statistical samples: so as far as statistical errors are concerned there is 
some (small) advantage in taking measurements spaced by A large. But mainly this was a test 
of our independence assumption used in the errors theory. If there had been a drastic change 
in the statistical errors we would have concluded that the correlation time for the entropy 
production was not of the order of 10 collision times. 

Another time scale of interest is A~[j^: this is however very difficult to measure and it may 
have value +00 for some values of E, (see Fig. 15 below). There is no evidence that this time 
scale is related to the entropy autocorrelation: which appears to be short ranged, as far as we 
can see. On the other end there is evidence that as E grows the number of positive Lyapunov 
exponents decreases. In §6 we try to establish a connection between this observation and the 
fluctuation theorem. 

This is very important for us as it shows that the chaotic hypothesis may hold in a far stronger 
sense than originally meant in [GCl], [GC2]. One can see that if the dimension of the stable 
manifold of the attractor points is not equal to that of the unstable manifold, then the closure 
of attractor for the forward motion cannot be the same as that of the attractor for the backward 
motion. Hence the time reversal cannot leave the attractor invariant hut the chaotic hypotesis, 
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as formulated in §1, tells us that nevertheless the motion on the attractor is reversible in the 
sense that there is a map i* which leaves the attractor invariant and changes S into S~^. 
The map i* (whose existence is far from obvious) will be called the local time reversal on the 
attractor, see §6,[BG]. 

§5. Ising model analogy. 

The fluctuation theorem as expressed by (2.10) and the subsequent comments on the Gaussian 
nature of the function 7r^(p) may seem somewhat strange and unfamiliar. 

It is therefore worth pointing out that the phenomenon of a "linear fluctuation law" on the 
odd part of the distribution, in the sense of (2.10), without a globally Gaussian distribution, 
is in fact well known in statistical mechanics and probability theory. And an example of what 
the fluctuation theorem means in a concrete case, in which tTt{p) is not Gaussian, can be made 
by using the Ising model on a 1 dimensional lattice Z. 

We consider the space C of the spin configurations a = {a^}, ^ G Z and the map S that 
translates each configuration to the right (say) . The "time reversal" is the map i : {a} {— o} 
that changes the sign to each spin. 

The probability distribution that approximates the SRB distribution is the finite volume Gibbs 
distribution: 

~ normalization 

where A = [— T, T] is a large interval, J,h > 0. The configuration g_ outside A is distributed 
independently on the one inside the box A, to fix the ideas. 

Calling (m)+ the average magnetization in the thermodynamic limit we define the magnetiza- 
tion in a box §, ^] to be M.^ = T{m)^p and we look at the probability distribution tt^{p) of 
p in the limit T ^ oo. The Gibbs distribution corresponding to the limit of (3.1) will play the 
role of the SRB distribution. Calling this limit probability TTrip) it is easy to see that: 

^r{p) 



t^t{-p) 



This is in fact obvious if we take the two limits T — > oo and r — > oo simultaneously by setting 
T = ^. In such a case, if denotes summation over all the configurations with given 

magnetization in [— T,T], i.e. such that E/=-^ = the distribution (3.1) gives us 

immediately that: 



i-p) Ea, -p exp J EJ=-T + h T,J=-T (^3 



^2rh{m)+p (-33-) 



if we use the symmetry of the pair interaction part of the energy under the "time reversal" 
[i.e. under spin reversal). 

The error involved, in the above argument, in taking T = ^ rather then first T 00 and 
then T — > 00, can be easily corrected since the corrections are "boundary terms" , and in one 
dimensional short range spin systems there are no phase transitions and the boundary terms 
have no influence in the inflnite volume limit {i.e. they manifest themselves as corrections that 
vanish, as T — > 00 followed by r ^ 00). 

One may not like that the operation i commutes with S rather than transforming it into 
S~^. Another example in which the operation i does also invert the sign of time is obtained 
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by defining i as = —(T-j'- the (3.3) can be derived also by using this new symmetry 

operation. 

The above examples show why there is a priori independence between any Gaussian property 
of TTrip) and the fluctuation theorem. The theory of the fluctuation theorem in [GCl] is in fact 
based on the possibility (discovered in [S]) of representing a chaotic system as a one dimensional 
short range system of interacting spins (in general higher that |); and the argument is, actually, 
very close to the above one for the Ising model with, however, a rather different time reversal 
operation. See [G2] for mathematical details on the boundary condition question. 
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Experimental results. 



We have made several measurements. Each measurement is quite delicate and time consuming, 

hence the reader will probably forgive us for not having done all the experiments that one finds 
natural to do. Each experiment requires several days of CPU time on the computers that we 
used (and several months to prepare the final runs). The statistical errors have been measured 
by 3 times the standard deviation, and the other errors arc estimated by following the criteria 
discussed in [GG] and resumed in the appendix. Thus the experiments should be reproducible 
within our error bars if the latter are defined as we do. Hopefully the data we give can be of 
use even if one decides (for mathematical reasons or to test other theoretical ideas) to change 
the assignments of the errors. In all the following graphs the error bounds are always marked 
although sometimes they may be not visible. 

4.1. Periodic boundary conditions, N = 2: The values of i?i,i?2 are respectively 0.2,0.4; the 
particle radius is 0.005 and the electric field is fixed E = 1., a, value that seems to be quite large 
(see however §6). The qualitative data of the resulting evolution are given in the first column 
of the table (2.8). Other experiments at varying field intensity are described in §6. 

4.1.1 The probability distribution tTt{p): The evolution is studied over 1.08 • 10^ collisions. In 
Fig. 2 we give the graph of 7rT-(p) for various r. 



r=20 



7r(p) 

0.70 
0.60 
0.50 
0.40 
0.30 
0.20. 
0.10 

0.00 ^ 

-6.00 -3.00 0.00 3.00 6.00 



r=40 




7r(p) 

0.70 1 T 

0.60 ' 

0.50 ' 

0.40 \^^^ 

0.30 |/ '*, 

0.20 . ;1 

0.10 /l 

0.00 

,1 . I . ^ 

-6.00 -3.00 0.00 3.00 6.00 



(4.1) 




-6.00 -3.00 0.00 3.00 6.00 



P 



-6.00 -3.00 0.00 3.00 6.00 



P 



Fig. 2: The histograms lor TVr{p) at various values of the observation time r — 20,40,60,100. Eaeh vertical bar is 
the error bar centered around the measured point. The dots on the axis mark the extremes of the interval where the 
observed data differ form within the statistical error. 



The error bars are very small particularly for the data at the edge of the observability interval 

because the data are very many. But the relative errors (not shown) are very small at the 
center and they are very large at the edges, of course. 
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We have attributed to each point on the above histograms a statistical error as explained in §5 
below (essentially they have been supposed independent variables and they have been given an 
error of 3 times the standard deviation). In fact the analysis of dispersion of each value shows 

that the "law of large numbers" is obeyed, and the standard deviation m„(r) = ((p — (p)t)")j 
n = 2, and the third order deviation, n = 3, approach with an apparent decay given by: 



m2(T) = - .005(±.002) + 39.80(±0.05)- 

T 

rmir) = - .002(±.004) + 93.(±1.5)^ 



(4.2) 



The error analysis leading to (4.2) follows the same scheme of [GG]. We can also use here 

the notion of "goodness" introduced in [GG] to measure how good a fit is (reproduced in the 
appendix below), and we measured the goodness of the above fits. We do not use the word 
"reliability test", and use "goodness" instead, to avoid inducing the reader to think that we 
rely on some standard error analysis. The goodness is 2.45 • 10~^ and 4.94 • 10~^ respectively. 
The results are given in the following Fig. 3: 
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Fig. 3: The decay of the 2*^ and 3*^ moments, as functions of — or and the kurtosis. 

The question of whether the probability distribution 7r.r(p) is Gaussian is investigated by 
computing the kurtosis as a function of r: the latter quantity is a dimensionless parameter and 

/^4i_2/ 2i2 

it is related to the fourth moment: — . It can be fitted by a law: 




k{t) = [0.00(±0.01) + 2.3(±0.3)-] x — 

T m2{T) 



(4.4) 
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The data arc reported in Fig. 3. But, milike the cases of m,2{T) and TO3(t) the goodness of this 
fit is 1.63 • 10~^ and it is comparable with the goodness of fits with a law or even the 
errors are too big so that many fits are "as good" (and also very good: this only shows that 
the notion of goodness has shortcomings as much as any other accuracy test). 

We conclude that the distribution seems compatible with a Gaussian. Of course one expects 
a Gaussian behaviour for the small deviations, i.e. for (p — 1) ~ t~^/^: If one assumes that the 
system is Anosov (or just that it has an Axiom A attractor) then this follows, as a theorem, 
from the results of [S] (or [R2]). 

However we know that it cannot be Gaussian beyond the range 0(r~^/^), because it has 
support between ±p* with p* < +oo. sec (2.12) and §3. Hence wc the apparent closeness to a 
Gaussian distribution might be an accident, that might disappear as r increases. If it does not 
then this is an interesting question to examine, see §6. 

In Fig.4 we show the main quantity of interest here, i.e. the graph of: 



x{p) 



1 



log 



t^t{p) 



Tto{cr)+ 7rr(-p) 



(4.5) 



versus p (recall that {a)+ is (2A'' — l)j if j is the electric current): 
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(4.6) 
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Fig.4: The fluctuation theorem test for r — 20, 40, 60, 100. The dashed straight hnc is the theoretical prediction of the 
fluctuation theorem. The arrows mark the point at distance -^J ((p — 1)^) from 1. The error bars are inherited from 
the histogram data in Fig. 2. 



0.5 



1.0 



The abscissae axis is discretized and the number of events in which p falls in a given interval 

is taken as proportional to ■Kt{p). Therefore Fig.4 is a histogram. 

All values corresponding to different r collapse on a single straight line in the interval p G [0, 1.5] 
in the worst cases {i.e. largest r). For higher values of p (depending on r) the statistics becomes 
gradually more and more poor as the deviations from the mean value of p become too large. 
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4.1.2. Conclusions: From the above data we infer that one cannot exchide that the distribu- 
tion is Gaussian. Assuming that it is Gaussian then the fluctuation theorem predicts a standard 
deviation m^ir) = ^.fj^^^ = 7-' lines preceding (2.11) and the above experimental data 

give: 

A =37.96 ± .21 (Gaussian assumption) 

A =39.80 ± .05 (experiment) ^ ' ' 

where the Gaussian value is computed from the experimentally measured (cr)+ and Iq (to which 
we attribute a statistical error of 3 times the standard deviation), while the experimental value 
is computed from m2(r), sec (4.2); the error on the second line is as implied by (4.2). 
The standard deviations of p from the value 1 are for r = 20, 40, 60, 80, 100 respectively 1.41, 
0.99, 0.81, 0.70 and 0.63. Such values, multiplied by 3 (recall that our conventional statistical 
errors are 3 times the standard deviation), can be arbitrarily assumed to be the boundary 
between the small and the large deviations. 

Note that, as already mentioned in §2, if the chaotic hypothesis is assumed then we know from 

the theory of Sinai that the small deviations obey a Gaussian law as r — > 00: but the theory in 
general does not predict the standard deviation. The example of §3 is a good illustration, we 
believe, of the situation: in that case too we have a good understanding of the odd part of the 
distribution, but no grip on the even part and no reason to know a priori the distribution of 
the magnetization (i.e. also its even part). The results may mean that the values of t that we 
reach are not large enough to test the validity of the central limit theorem. On the other hand 
the fluctuation theorem that follows from the chaotic hypothesis is independent on the central 
limit theorem (as the example in §3 suggests and illustrates) and therefore the above results 
are, in our opinion, a good test of the chaotic assumption. We shall examine this point in more 
detail in §6. 

Of course the choice, mentioned above, of 3 standard deviations to measure the statistical 
errors and the large deviations "threshold" is arbitrary. One could, as very often done, decide 
to use 1 standard deviation instead. Then we could say that we can go quite far in the large 
deviation region, but on the other hand many error bars become too small to be compatible 
with the data. This is a well known problem with all experiments and we can just report 
that it appears also in the present experiment. It could only be solved by better experiments: 
hence we decided to perform experiments in which the computing facilities available to us were 
pushed further. The results will be described below (when discussing the case of semi-periodic 
boundary conditions and N = 2, 10). 

§4.2 Periodic boundary conditions, TV = 10: The geometry is the same as in the previous 
case. The values of i?i,i?2 are respectively 0.2,0.4; the particle radius is 0.005 and the electric 
field is fixed E = 1.. The qualitative data of the resulting evolution are given in the first column 
of the table (2.9). 

4-2.1 The probability distribution TTr{p): The evolution is studied over 7.57 • 10^ collisions. In 
Fig. 5 we give the graph of iTrip) for various r. 
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-4-2 2 

Fig. 5: The distribution TVrip) for r = 20.40, 60, 100. 

and in Fig. 6 we give the standard deviation and the third order deviation, as in the previous 



case as functions of r and r respectively. 
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(4.9) 
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Fig. 6: The decay of the 2** and 3*^ moments, as functions of or and of the kurtosis as a function of ^. 
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The fits give: 



m2(T) =0.009(±.005) + 25.24(±0.4) 



1 



msir) = - .0002(±.0003) + 54.3(±1.5)- 



(4.10) 



and the goodness of the above fits is 1.2 • lO""^ and 3.4 • ID""* respectively, for the data with 
r > 25. The data with r < 25 deviate from the above law and we interpret this as finite size 
effects (expected from the theory but absent in the prcvioTis case already for r > 25). 
In Fig. 6 the kurtosis graph is reported: for which we attempted a best fit as: 



k{t) = -0.01(±0.02) - 4.8(±0.9)- 

r 



(4.11) 



with a goodness of 4.03 • 10 The data are not many because the experiment is hard (in terms 
of CPU time). 

Finally the main quantity of interest, i.e. the graph of x{p) = ^^^^^^^^ log Jj(^^fp^ versus p: 
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(4.12) 



Fig. 7: The graphs for the fluctuation theorem test for r — 20,40,60, 100. The dashed straight line is the theoretical 
prediction of the fluctuation theorem. The arrows mark the point at distance ((p — 1)^) from 1. The error bars are 
inherited from the histogram data in Fig. 5. 

The abscissae axis is discrctizcd and the number of events in which p falls in a given interval 
is taken as proportional to TTr{p). Therefore Fig. 7 is a histogram. 

It is remarkable that the finite size effects, i.e. the manifestation of important deviations from 
the linear law for "small" r (as we interpret them), are here very clear: the case r = 20 does 
not follow the scaling, in contrast to r = 40, 60, 80, 100, (r = 80 is not shown). 
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§4.1.4 Conclusions, N = 10: The case = 10 is very similar to the case N = 2 as the 
theory predicts. From the above data we cannot exclude that the distribution is Gaussian in 
the observed range of values of p. Assuming that it is Gaussian then the theory gives a standard 
deviation m2(r) = ^^^'^^•^^ = ^, see (2.11) and the above experimental data give: 

A =25.82 ± .02 (Gaussian assumption) 
A =25.24 ± .4 (experiment) 

where the theoretical value is computed from the experimentally measured {a)^,to, while the 
experiment value is computed from the graphs for m2(r). The error analysis is carried along 
the same lines as that of the case N = 2 above. 

The standard deviations of p from the value 1 are for r = 20, 40, 60, 80, 100 respectively 
= 1.05, 0.79, 0.66, 0.57 and 0.51. Such values, multiplied by 3, can be arbitrarily assumed 
to be the boundary between the small and the large deviations. 

The same comments to the case N = 2 can be made here and therefore the data, in our 
opinion, yield a good test of the chaotic assumption in the case A'' = 10 too. 

§4.3.1 Semi-periodic boundary conditions, N = 2: The results are very similar in the case 
of semi-periodic boundary conditions and the following two graphs give the experimental values 
over 10* collisions (with the walls, other particles or obstacles) for the probability distribution 
^r(p) with T = 20, 100; the T^rip) at intermediate values of r have also been measured (r = 
40, 60, 80) but we do not give the graphs. 
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Fig.8: The histograms for ir^{p) for r = 20, 100. 
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(4.14) 



The errors are treated, naturally, as in the corresponding previous cases. 

The next two figures give the graphs of the x{p) in (4.5), which is expected to be the graph of 
x{p) = p. Again we give only the two extreme cases, r = 20, 100 wc have been able to measure: 
the agreement with the "theory" is as excellent in the intermediate cases (r = 40,60,80). The 
errors are slightly smaller than in the previous experiments because of the difference in the 
length of the computed trajectory. 
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(4.15) 



Fig. 9: The graphs for the fluctuation theorem test, case of ^(p6c), N — 2\ the dashed line is the fluctuation theorem 
prediction, r = 20, 100. The arrows mark the point at distance {{p — 1)^) from 1. 

Wc have also measured the kurtosis and the decay of the second and third moments: the results 
are very similar to the ones in the periodic boundary conditions case and can be commented in 
the same way. 



m2(T) 




0.000 



0.00 0.03 0.06 0.09 

K(r) 
0.3 
0.2 
0.1 

0.0 - 
-0.1 

0.00 0.03 0.06 0.09 
Fig. 10: The second moment, the third moment and the kurtosis. 

The fits are: 




0.005 



"^2(T) = -0.00(±0.002) + 45.1(±0.01)i 

= 0.001(±0.004) + 122.6(±10.4);^ 
k{t) = -0.009(±0.04) + 2.7(±0.8)i 



0.010 



(4.16) 



(4.17) 



and the goodness (see above, and the Appendix, for the definition) are 3-10 ^,2-10 ^ and 
1 • 10~^, respectively. 



17 



As in the previous cases we can assume that is a Gaussian and we see that the results are: 



A = 43.8 ± 0.03 
A = 45.1 ±0.01 



Gaussian assumption 
experiment 



(4.18) 



As in the case of periodic boundary conditions the distribution is quite close to a Gaussian, 
although it cannot be such, strictly speaking (for the reasons already discussed). 

^4-3-4 Semi-periodic boundary conditions and 10 particles: We have also considered a further 
iV = 10 particles system, but at a density quite different from the previous one. 

As in the previous case we need to take r rather large to see the fluctuation law x{p) = p, see 
(4.5), to hold. But it turns out that although at r = 20 "finite size effects" are still visible and 
very strong, already at r = 40 they become essentially not visible. 

Therefore we give the histogram for iTrip) at r = 20, 40, 60, 100: 
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Fig. 11: Histograms for tTt{p) at r = 2Q and r = 100. 
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(4.19) 
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and the graphs of x{p) at r = 20, 40, 80, 100: 
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Fig. 12: The linear fluctuation test, r — 20,40,80,100. The dashed line is the fluctuation theorem prediction for 
r = +00. The arrows mark the point at distance {{p — 1)^) from 1. 

which illustrate well the fact that there is a visible finite size effect that very soon becomes not 
observable. 

The experiments were carried out over 10^ collisions. The following graphs give the kurtosis 
and the decay of the second and third moments. 
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The corresponding fits are, excluding the last point that is clearly a finite size effect: 



m2(r) = -0.002(±0.0005) + 30.7(±0.1)- 

r 

TO3(r) = 0.0001(±0.001) + 84.3(±11.3)4 



(4.22) 



-0.00(±0.07) + 2.3(±5.7)- 

r 



and the goodness are 2-10 ^,1-10 ^ and 1-10 ^, respectively. 

As in the previous cases if we assume that -Kt (p) is a Gaussian we see that the results are: 



A = 29.2 ± 0.002 (Gaussian assumption) 
A = 30.7 ± 0.1 (experiment) 



(4.23) 



§5. Error analysis of the distribution iTrip)- 

A brief description of the methods that we use to define the errors follows. 
For a given r, we build a time sequence of different p values: 



Pn =PiXnt') = 



M 



n=l,....M 



(5.1) 



where Xnt' = S"* x, t' = r/2 + A with x randomly chosen in phase space with absolutely 
continuous distribution.and we have chosen A = 50 in order to decorrelate contiguous evolution 
data points. The (cr)^ is fixed by normalization, so that the property J2n=iPi^"-t') ~ 
holds, see (2.7). 
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That A = 50 is a sufficient dciay is warranted by the size; of the entropy autocorrelation 
decay rate (denoted i? in the tables of §2). In general we would expect that A should be large 
compared to An accurate determination of i? (including a verification of the validity of an 
exponential law of decay, which is not a priori obvious: see the similar problems arising for the 
one particle case in [GG]) is a major enterprise and we have only made a few empirical tests on 
the order of magnitude of i?: and our choice A = 50 was dictated by purely numerical reasons 
as a reasonable compromise between small statistics, accuracy and computer availability. It is 
justified, in all our experiments, only by the empirically determined independence of the results 
(apart from the size of the errors) in the cases A = 20 and A = 50. We reported only the 
A = 50 results. 

We define the discrete probability distribution: 

7rf(/;M) = Z = -100, ..,100 (5.2) 

where = [Ir, (/ + l)r] and r = 1/10. The relation between the discrete and the continuous 
distribution when M ^ oo is: 

7rf(Z;oo) = / dpT,r{p) (5.3) 
Ji{l) 

Assuming that r is small enough we can expand the right hand side of (5.3): 

r-^Trf («; oo) = 7r,(;r + r/2) + r!^!gM|p^,,+,/, + 0{r*) (5.4) 

We have checked that this ©(r^) correction is in all our cases negligible compared to other 

sources of error: this has been done by approximating in (5.3) the function tIt{p) by a Gaussian 
distribution (which, a posteriori , is a good approximation to tt^ {p) in the range of p's that we 
study): 

7r,(p)^-^exp|-fc^l (5.5) 



where a = m2 is the experimental standard deviation of the distribution. By substituting (5.5) 
into the term with the second derivative in (5.4) we get: 



(5.6) 



with p = Ir + r/2. 

We also have to estimate the error involved in approximating 7r^(l;oo) by 7r^(i;M). Let us 
define 

Pr,i{m;M)=(^^y?{l;oor{l-n?{l;oo))''-"' (5.7) 

where Pr^i{m; M) is the probability that in M elements of a sequence there exists m in the 
interval defined by I. This means that we regard the various measurements of m as independent: 
because the empirical autocorrelation of the the entropy production decays on scales smaller 
than A, the interval between successive measurements. 

Then, the average value of m and its mean square displacement are given by: 

{m)r,i = Mn^Hioo) 



21 



((m-(m).,n = 



(5.8) 



M 



When M is large enough, we shall assume that: 



f(/;M) (l-7r?(Z;M)) 



1 1/2 



Trf (/;oo) =7rf (l;M)±3 



tt: 



(5.9) 



is an appropriate measurement of the error. Equations (5.6) and (5.9) are the relations used in 
the distribution analysis. 

The error analysis of the fits for m2, ma, k is the same as the one used in [GG] and it is 
repeated, with some minor changes to adapt it to the present sistuations, in the Appendix 
below. 

§6. Conclusions. Outlook. 

(i) Consistency and CODES: The above experiments, sec Fig. 4,6,9,12 seem in "good" agree- 
ment with the theoretical predictions. We have attempted at a very accurate test compatibly 
with our computer resources and the need of a natural time cut-off on the duration of the ex- 
periments. The only real limitation was the computer time available; not so much as available 
to us but to present day technology. By using the largest existing computers our results do 
not seem to be substantially improvable: the motion being chaotic there is not much that one 
really can do without really new ideas. 

Our attitude is that the theory is general and it should apply to any system like the ones 
described in (2.2). So in particular to our computer programs that we can refer to as the 
CODES. For such dynamical systems our experiments are, by definition exact and the systems 
are also by construction "close" to (2.2). 

Therefore on such grounds we could say that we can think that the only errors in our theory 
are the statistical errors, i.e. our experiment is as "perfect" as one could wish. 

Nevertheless, not surprisingly, the situation is more subtle. The reason is mainly that we have 
been unable to write CODES which "solve" (2.2) (in some sense, that is not very relevant for 

us here) and which at a,t the same time verify the tim,e reversibility property. 
We think that this is a serious flaw: because the theory, see [GC2], rests on time reversal. 
Strictly speaking, then, we should apply the first part of the chaotic hypothesis and only 
assume that the attractor verifies the Axiom A. 

However for this we have no theory: the only argument that one could give, and which we do 
not find convincing, is that the CODES are certainly "close" approximations to (2.2). And we 
arc used to think that close systems behave closely. The abundance of counterexamples has 
not deterred people to have the feeling that there is some truth to such belief {natura non facit 
saltus). 

But since it might be simply impossible to write reversible, energy preserving, CODES what 
we have done looked to us to be the only possibility we had to test the principle. 

It would be interesting to carry out experiments analogous to the above described ones for 
other types of systems for which reversible algorithms can be written and implemented at least 
in the conservative discovered in [LV]: it is unclear, however, if they can be extended 

to cover the dissipative cases and, if so, if this can be done with the accuracy necessary to the 
test of the chaotic hypothesis.^ 

The important paper [LV] had escaped our attention until very recently, too late to take it into account: it proves 
that reversible codes do exist for systems not too far from ours and it is certainly important, and probably possible, 
to try to adapt them to test the chaotic hypothesis in a truly reversible CODES. 
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(ii) The pairing rule, Axiom, A attraetors and reversibility: If the chaoticity hypothesis, in the 
form given in §1 for the reversible case, is interpreted as meaning that the system behaves as 
a transitive Anosov reversible system, with the time reversal operation being the "global time 
reversal" i, then it implies that the stable and unstable manifolds have the same dimension. 
So we would expect to have as many positive and negative Lyapunov exponents. 
However the existence of the phenomenon in which as E grows some Lyapunov exponent that is 
> at small E becomes < at larger E was already pointed out in [DPH]. And tests showed, sec 
Fig. 14 and Fig. 15 below, that in the case of 10 particles with semi-periodic boundary conditions 
the 19-th (out of the 38) Lyapunov exponents is likely to be < 0. 
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Fig.l^: The 38 Lyapunov exponents for 10 and ^(pfcc). The small picture is an enlargment of the tail of the larger 
one and it shows more clearly the pairing rule and that the 19-th exponent is slightly negative. 

A simple test to see how reliably we have evaluated the Lyapunov exponents can be, neverthe- 
less, easily made: it consists in testing the pairing rule, discovered in [EM], [ECMl], verified to 
an almost unthinkable accuracy in experiments by [DPH], and recently proved for cases that 
include precisely our systems, see [DM]. Although we did not push our study to check the 
pairing with the remarkable precision reached in [DPH] we find that the pairing rule is obeyed. 

Wo stress, however, that the program we use to compute the Lyapunov exponents is very 
close (although not identical) to the actual scheme followed in [DM] to prove the pairing rule: 
therefore the test does not provide a test on the accuracy of the exponents. The pairing rule 
has in fact to be fuUfillcd with high precision even if the precision reached in measuring the 
exponents is not comparable (in other words the errors that one makes on each exponent of a 
pair compensate exactly, see [DM], if they are due to the shortness of the runs). 

The existence of one negative exponent in excess lead us to study the question in more detail. 

In Fig. 15 is shown the graph of the Lyapunov exponents Xj, with j = 13 19 in the ipbc and 
10 particles for E = OA to E = 5.0 at steps of 0.1. The results are still "raw" in the sense that 
we have not yet been able to make a satisfactory study of the errors: the method we used is the 
"usual method", [BGGS], with a test trajectory of 0.5 • 10^ collisions. The errors are however 
quite large and more accurate experiments are necessary to confirm the raw data. 



23 



A 



0.015 



0.010 - 



0.005 



0.000 — 



-0.005 



-0.010 - 



-0.015 - 



-0.020 - 



-0.025 - 




(6.2) 



0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 ^ 

Fig. 15: The IZ-th through the 19-th Lyapunov exponents in the case N = 10, ^{pbc): row data (no error bars 
estimated). The pairing rule, [EM]. [ECMlj. [SEM], [DM] is verified. 



The preponderance of the negative Lyapunov exponents over the positive ones does not seem 
due, at least in the cases where it appears clearly, to the non-reversibility of our CODES: hence 
the existence of more negative than positive exponents is a very likely to be a real phenomenon 
at large field (and fixed A^). 

Then one can ask the question: is there any reason to think that the fluctuation theorem might 

be valid even if the attractor is "only" an Axiom A attractor? 

We have not been able to test the fluctuation law at values of E where there was an appar- 
ent excess (we say "apparent" because our error analysis in not satisfactory, yet) of negative 
Lyapunov exponents, (say at E = 2.5 in the case of 10 particles and ^(pbc) where we see 2 
negative Lyapunov exponents in excess, see Fig. 15). The reason is that the estimated time for 
the corresponding numerical experiment is well beyond the present day computer capacities. 
Large fluctuations may be very difficult to observe at very large fields. 

We do not think that the fluctuation law can hold by chance: but we have seen it well verified 
in a situation where there seem to be one negative Lyapunov exponents in excess over the 
positive ones, see Fig. 12 (and Fig. 15 with E = 1). However from the analysis in [GCl], [GC2] 
its validity appears very tightly related to the existence of a time reversal symmetry leaving the 
closure of the attractor invariant, i.e. a map i* defined on the closure of the attractor and such 
that Si* = i*S-^. 

Therefore we conclude that a not unreasonable scenario would be that when there are pairs 
of Lyapunov exponents that consist of two negative exponents then the attractor and its accu- 
mulations points can be simply regarded as a smooth lower dimensional surface.^ The motion 
on this lower dimensional surface (whose dimension is smaller than that of phase space by an 
amount equal to the number of paired negative exponents) will still have an attractor (with 
dimension lower than the dimension of the surface itself, as suggested by the Kaplan- Yorke 
formula, [ER]). And on such manifold the motion will still be reversible in the sense that there 



* This docs not preclude the possibility that the attraetor has a fraetal dimension (smoothness of the closure of an 
attractor has nothing to do with its fractal dimensionality, see [ER],[GC1],[G1]). 
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will be a map i* of the attractor into itself {certainly different from, the global time reversal 
map i) which inverts the time on the attractor and that can be naturally called a local time 
reversal, see [BG]. 

Then manifestly one would be back with an Anosov system (on a lower dimensional manifold) 
and a version of the fluctuation theorem would still hold. Furthermore one could say that this 
is only a different interpretation of the chaotic principle of [GCl], [GC2] (which in such case 
does not even require to be reformulated to apply). 

If this picture is correct we can write the phase space contraction rate (see (2. 4), (2. 5)) a{x) = 
CTo(x) + cr_i_{x) where (To(x) is the contraction rate on the surface on which the attractor lies 
and (J_\_(x) is the contraction rate of the part of the stable manifold of the attractor which is 
not on the attractor itself (the angle between the part of the stable manifold sticking out of the 
attractor and the attractor itself is disregarded here as we think that it is bounded away from 
and TT since the attractor is compact). 

Of course the local time reversal will change the sign only ofao{x) and the fluctuation theorem 
should apply to the fluctuations of tro. But ao{x) is not directly accessible to measurement: 
nevertheless we can still study its fluctuations via the following heuristic analysis. From the 
proof in [DM] of the pairing rule one sees that the jacobian matrix J of the map S is such that 
\/j*J has D pairs of eigenvalues and the logarithms of each pair add up to J^'"^^ a{Qtx)dt (see 
also (2.4), (2.5)). 

The simplest interpretation of this, in view of the above proposed picture of the attractor, 
is that the pairs with elements of opposite signs describe expansion on the manifold on which 
the attrac;tor lies. While the M < D pairs consisting of two negative eigenvalues describe the 
contraction of phase space in the directions transversal to the manifold on which the attractor 
lies. Then we would have toao{x) = {D — M) j^^^^ a{Qtx)dt and we should have a fluctuation 
law for the quantity p(a;) associated with ao{x) deflned by (2.6) and (2.7) with ctq replacing a, 
i.e. (accepting the above heuristic argument): 

^or(x) = i^^^{a)+p{x) (6.3) 

i.e. a law identical to (2.10) up to a correcting factor 1 — ^: 

^og^^,=rto{a)+{l-^)p (6.4) 

TTri-p) D 

The graphs of Fig. 12 are relative to an experiment in which we see that there may be one 
negative exponents in excess over the positive ones (as said above, sec also Fig. 15): it is very 
small (see Fig. 14), and it carries an error bar that we estimate to be so large to allow for positive 
values as well. The graphs, however, show that the agreement with the experiment of (6.4) is 
within the errors: had there been no negative exponents we would have expected a slope 1. 
If there is one negative exponent in excess we expect a slope 1 — ^ which is within the error 
bars in Fig. 12 (had we drawn in Fig. 12 the best fit line rather than the line with slope 1 the 
agreement with the slope 1 — would have been even better). An excess of 2 exponents would 
yield a slope of 1 — ^ which is out of the error bars. 

Note that since the exponent smallest in modulus is so small we must expect that it yields a 
clear effect only after extremely long times have elapsed {i.e. for values of r > 4. 10^: totally 
out of computability). 

The difficulty on the above scenario is that there is no a priori reason to think that the 

attractors should have the above structure: i.e. fractal sets lying on smooth surfaces on phase 
space on which the motion is reversible. However such a picture is very suggestive and it might 
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be applicable to more general situations in which the reversibility holds only on the closure of 
the attractor and not in the whole space (like "strongly dissipative systems"). An attempt at 
discussing this point can be found in [BG]. 

(in) Smoothness of the closure of the attractor: The smoothness of the surface on which the 
attractor lies, so that the attractor can be regarded, in itself, as an Anosov system, is very 
likely not necessary, from a mathematical viewpoint. The more general assumption that the 
attractor verifies the Axiom A (and is transitive) would be sufficient if accompanied by the 
{very strong) assumption that the attractor is mapped into itself by a symmetry i* such that on 
the attractor i*S = S~^i* (discussed in (ii) above). We stress again that i* cannot he the same 
as the global time reversal i because the latter will map the attractor for the forward motion 
into the one for the backward motion: however through simple examples of reversible motions 
with attractors closures smaller than the whole phase space (hence verifying Axiom A but not 
the Anosov property) one can see that the existence of i often (always?) induces the existence 
of a map i* on the attractor, se [BG]. 

We insist in talking about smoothness for the following two reasons: 

(1) because we think that the theory applies to general many particle systems and we cannot 
see the relevance of a possible fractional dimension over N = 10^® total dimensions: in other 
words we can proceed as if the dimension was integer (in [GC2] the fractality is called "an 
unfortunate accident" that may happen in the problems that we study), hence as if the system 
is Anosov, provided we accept that the attractor has a time reversal symmetry {i.e. there is a 
map i* of the attractor into itself that anticommutes with the time evolution, Sfi* = i*S-t). 

(2) because, as it partially appears from Fig. 15, the Lyapunov exponents seem to evolve, as 
a function of E, along the following pattern. At smaller E they are all non zero (as shown 
by Fig. 15): the dimension of the closure of the attractor in AN — 2 {i.e. that of the phase 
space C). Then as E grows one of them crosses continuously (and with non zero E derivative, 
i.e. transversally) the value at some Ei > becoming negative for larger values of the field; 
the closure of the attractor has now dimension AN — 4. At £^2 > -^i a second Lyapunov 
exponent crosses (transversally) and the closure of the attractor has now dimension 4A^ — 6, 
etc. This suggests that, as E varies, the attractor is characterized by more and more "constants 
of motion" , i.e. by the vanishing of more and more observables. Every time one more "constant 
of motion" is born we see that the attractor loses two dimensions. Nothing suggests to us that 
it becomes non smooth, or appreciably so. 

One should also bear in mind that the above analysis is, necessarily, carried over in systems 
with few degrees of freedom. It might well be that the picture can considerably simplify at 
large N. See below for some thoughts on that point. 

(iv) Fluctuation theorem as a reversibility test: Since the very derivation, [GC1],[GC2],[G2], 
of the fluctuation theorem is so intimately related to reversibility one could say the if the 
predictions of the fluctuation theorem are verified, perhaps with a slope < 1 as suggested by 
(6.4), then this is a sign that the dynamics on the attractor is reversible in the sense that it 
is mapped into itself by a map i* such that i*S = S~^i* (hence i ^ i* unless the system is 
Anosov and the attractor is the whole space) . It is clear that the above considerations give rise 
to several tests that can be experimentally performed. 

Note that not only the linearity in p is a strong statement, but also the predicted value 

< 1 is somewhat surprising: naively one could be tempted to think that the contraction rate 
transversal to the attractor is uniform over the attractor: this would lead to a slope > 1 (and 
the larger the stronger is the attraction from the attractor). 

(v) Gaussian? Central limit theorem? One may say, as a first reaction to the analysis, "of course 
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wc cxpcictcid a Gaussian distribution of the fluctuations because the dissipation TU-r is a sum 
of many terms that are statisticaUy independent" if the motion is chaotic. Hence "everybody 
(reasonable)" would expect a Gaussian distribution for the fluctuations. This would mean that 
(p — 1) has a dispersion of order Cr^ 2 for some (non trivial) constant C related to the entropy 
autocorrelation function {a{S"^x)a{x)): 

00 

^' = 7^ E (M^"-M-)>+-(^)l) (6.5) 

^ ' ~l~ T),= — no 



And it would imply that: 

Ir^or , 

However this would not imply that ^ = to{a)+: which would mean that, in general, it is also: 



log — 7=^7^?' (6-6) 



(a+) = | J2 ((^(^"•)^(-))+ - (^)l) (6.7) 

n= — 00 

The latter would be a strange prediction in the context of the central limit theorems. Further- 
more the central limit theorem is expected to hold only for fluctuations of p — 1 the order of 
r^^/^ while (2.10) compares the probabilities of p to those of —p which, for p ~ 1, describe de- 
viations of order r. In other words there is no relation between the usual central limit theorem 
for p and the fluctuation theorem (when they are expected to hold, i.e. for r 00). 
Therefore it came as a surprise to us that the results were instead consistent, on the whole range 
of observability, with a Gaussian distribution. We would have thought that log7r,-(p) = —t(^{p) 
(for T large enough) with only the odd part of C^{p) strictly proportional to p, while the even 
part had no reason to be quadratic. And the example of §3 shows how natural it would be that 
is not quadratic. The constant C would have been mainly determined by the curvature of C,{p) 
at its maximum p = 1, totally unrelated (wc thought) to (''■) + • 

Naturally we expected (6.7) to hold for very small external field E: in the case iV = 1 it was 
indeed proved by [GELS] to hold for values of E that, in our units, are extremely small compared 
to 1: and the equation (6.7) is known for small field ("linear regime") as the "fluctuation 
dissipation" relation (or "Green-Kubo formula"). This relation has been shown to be closely 
related (and essentially a consequence) of the chaotic hypothesis in reversible systems for E 
close to 0, (see [G5] cq. (5.9), taking into account the special form of cr, its linearity in E and 
the fact that the energy is 1) still with corrections of 0{E'^). 

Note that as S — > the two sides of (6.7) have both size of order 0{E'^) and the fluctuation 

dissipation theorem is the relation obtained by dividing the two sides by E'^ and letting E ^ Q. 
There should be a deeper reason for the relation between the small deviations constant C 
(relevant in the fluctuations of scale \/t) and the large scale fluctuations (of size 0(r)) which 
are related to ((t) + , sec (2.10): this seems to be explained in [G6]. 

In the "old" literature one can find statements that today sound somewhat mysterious like: 

"It is empirically known that for macroscopic values of a, i. e. for values of the a, much larger 
than their root mean square values at equilibrium, the averages of these quantities frequently 
obey linear differential equations. " 

which is then used to establish that relations between properties that hold for small fluctuations 

hold also for large ones, at least in the average. In [DGM], p. 100, the above statement is the 
beginning of a classical derivation of the Onsager reciprocity relations. One may speculate. 
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since the chaotic hypothesis is related to the Onsager relations as well, [G5], that our "strange" 
relation between the small and large deviations might be related to the fact that a similar 
property can be taken as the basis of a derivation of Onsager's theory of reciprocity. Small and 
large fluctuations seem to have some common properties that one may not expect a priori or, 
at least, that one may consider worth of being challenged and tested, see [DGM], p. 102. This 
point of view leads to the paper [G6] where it is developed and its consequences are drawn. 

The connection between large and small fluctuations is, if at all present, likely to be a peculiar- 
ity of models typical of statistical mechanics (short range interacting many particle systems): it 
may not hold in other cases in which the chaotic hypothesis can be applied (sometimes leading, 
nevertheless, to reciprocity relations of Onsager type) like in fluidodynamics models, [G4]. 

It is worth stressing again that we know that the distribution in p cannot be Gaussian for all p's: 
because there is a maximum (r-independent) value that p can take, just from the finiteness of 
phase space. This value, called p* in [G2] , can be easily measured in our experiments (see (2.12)) 
but it is very far away from the region where we have enough statistics to make meaningful 
measurements, as it appears from the graphs reported above. 

(vi) Time scales for large N : A somewhat more speculative scenario can be drawn for large N . 
We mention it because we hope to receive some help in a program directed to test it. It seems 
reasonable to us that in the thermodynamic limit the Lyapunov exponents of the system will 
fall into two categories each consisting of a pair of exponents. The sum of the values of each 
pair is constant and equal to half the average entropy creation rate (this is the pairing rule, see 
above). 

A large number of pairs {0{N)) will consist of one vanishing exponent and its negative com- 
panion. The remaining positive Lyapunov exponents will all he identical: marking the time scale 
of local approach to equilibrium, hence also the other negative exponents will be identical (by 
the pairing rule). By identical we mean here that the ratio between the largest and the smallest 
positive Lyapunov exponents is bounded uniformly in N (away from and oc). 

This is in perfect agreement with fig. 5 of [DPH] describing a very high density system: it is 
not in agreement with the other results of [DPH] . 

Nevertheless, as argued in also in [G3], it is possible that the low density results are fiawed 
in this respect as one would need far too large systems to obtain Lyapunov exponents for a 
distribution close to the thermodynamic limit distribution /x. Only in the high density case a 
small sample of gas exhibits the features of a large sample. 

The above picture merges with the ideas in [G4] relating the Lyapunov exponents to the 
macroscopic modes described by macroscopic equations, while the nonzero exponents describe 
the approach to local equilibrium. It also matches with Fig. 14 where a rather sharp drop of 
the Lyapunov exponents towards appears betwen the 10-th and the 11-th exponent. 

Since it is always assumed that there is only one microscopic time scale for the local approach 
to equilibrium it is perhaps a natural conjecture that the Lyapunov exponents should have the 
above structure. 

{vii) We sec that the above experiments raise perhaps more problems than expected: but the 
chaotic hypothesis emerges as not inconsistent with the data. 

% Appendix. Fits and Errors. 

This appendix is adapted to the present experiments from the corresponding appendix in 
[GG]. We get data sets from our computer experiments, say y {x) = {y{xi)}i=i^N , where 
X = {a;j}j=i,iv is in our case an independent variable, say for instance a set of N collision 
numbers or time instants. To the latter experimental data set of points, we want to fit a given 
guessed function, say f{x; a), where a = {an}n=i,p is a set of arbitrary parameters. Here by 
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fit wc mean to find a sot of parameters a * whieh optimizes some reasonable functional relation 
between the experimental data and the fitting function. 
In our case we use the least squares functional, i.e. : 



N 



i=l 

The set of parameters a*{y) is here obtained by asking that they should be the minima of 
the V function: da*V{y , a*} =0 . We also define the goodness of our fit, G, by the average 

y-distance of our data to the function f{x; a *): G{y{x)) = {V{y{x),a*)/NY'\ This 
parameter is only meaningful when it is compared with the one from another fit. Given many 
fits, the one with smallest G value will be called best fit (among the considered fits). 
The data have, in general, non-negligible errors, say e = {ei}i=i,Ar, due to the finite number of 
samples used in the averaging (see comments in §2). Such errors induce errors on the parameter 
values. Therefore, a measure of the error amplitude in a,*{y ) is given by: 

^ea*{y)=[a*{y + e)-a*{y-e)\l2 {A1.2) 

In the particular case in which the magnitude of the data error is much smaller than the 
measured value, \£i/y{xi)\ « 1, we may expand the latter equation around £ = 0: 

TV 
i=l 

The coefficients c^"' are found by expanding V{y{x) + £, a) around a,*{y) and e = and 



they are given by: 

p 

J2^^ndli,,)a'^V{y,a*) {A1.4) 



(") 



where Dmn = ^a'^a'^^iy, a*). 

In particular for the linear fit, f{x, a) = ai + a2X, the coefficients c\^^'^'^^ are given by: 
where Aa; = (x — xY and = X^i^i 

The errors are random variables and we have to average them over their distribution. In 
all cases considered it seemed reasonable to consider the errors as independent variables. 
Therefore we empirically estimate an upper bound for their correlation values: 



|(£i£,)|<^(£|)(S^) (A1.6) 
The parameter errors in our analysis are defined by the equations: 

N N 

i=i j=i 

where 6f = (e?) and their use and meaning is described entirely by the above comments. 
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